Using self-organizing maps and multi-layered feed-forward nets to obtain phonemic transcriptions of spoken utterances

نویسندگان

  • Mikko Kokkonen
  • Kari Torkkola
چکیده

Two schemes to obtain phonemic transcriptions of spoken utterances are described and compared. Both schemes utilize the so called Self-Organizing Kohonen Maps first to vector quantize speech into a sequence of phoneme Iabels centisecond apart. In the original scheme, this quasiphoneme sequence is converted into a phoneme string with simple durational transformation rules. In the scheme introduced in this paper, the conversion is carried out by using a multi-layered feed-forward network trained with error back propagation. The achieved phonemic recognition error rate is about 2.5 per cent units better with the multi-layered network approach (19.2% opposed to 21.7%). However, the back propagation algorithm requires a vast amount of training compared to the rule-based method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Signal Prediction by Layered Feed - Forward Neural Network (RESEARCH NOTE).

In this paper a nonparametric neural network (NN) technique for prediction of future values of a signal based on its past history is presented. This approach bypasses modeling, identification, and parameter estimation phases that are required by conventional parametric techniques. A multi-layer feed forward NN is employed. It develops an internal model of the signal through a training operation...

متن کامل

Steel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps

Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...

متن کامل

An Empirical Comparison of DimensionalityReduction Techniques for Pattern Classi

To some extent or other all classiiers are subject to the curse of dimensionality. Consequently, pattern classiication is often preceded with nding a reduced dimensional representation of the patterns. In this paper we empirically compare the performance of unsupervised and supervised dimensionality reduction techniques. The data set we consider is obtained by segmenting cells in cytological pr...

متن کامل

The AUTONOMATA Spoken Names Corpus

In the Autonomata project we have collected a corpus of spoken name utterances with manually corrected phonemic transcriptions of these utterances. The corpus was designed with the intention to become a major resource for the development of automatic speech recognition engines that can achieve a high accuracy on the recognition of person and geographical names spoken in Dutch. The recorded name...

متن کامل

Self-organizing Maps and Ancient Documents

This paper presents how Self-Organizing Maps and especially Kohonen maps can be applied to digital images of ancient collections in the perspective of valorization and diffusion. As an illustration, a scheme of transparency reduction of the digitized Gutenberg Bible is presented. In this two steps method, the Kohonen map is trained to generate a set of test vectors that will train in a supervis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 9  شماره 

صفحات  -

تاریخ انتشار 1989